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NOVEL HUMAN PROTEASES AND POLYNUCLEOTIDES 

ENCODING THE SAME 

The present application claims the benefit of U.S. 
5 Provisional Applicati on Number 60/174, 686 which was filed on 
January 6, 2 000 and is herein incorporated by reference in its 
entirety. 

1 . INTRODUCTION 
The present invention relates to the discovery, 

10 identification, and characterization of novel human 

polynucleotides encoding proteins sharing sequence similarity 
with mammalian proteases. The invention encompasses the 
described polynucleotides, host cell expression systems, the 
encoded proteins, fusion proteins, polypeptides and peptides, 

15 antibodies to the encoded proteins and peptides, and 
. genetically engineered animals that either lack or over 
express the disclosed sequences, antagonists and agonists of 
the proteins, and other compounds that modulate the expression 
or activity of the proteins encoded by the disclosed 

20 polynucleotides that can be used for diagnosis, drug 

screening, clinical trial monitoring and the treatment of 
physiological disorders. 

2. BACKGROUND OP THE INVENTION 
25 Proteases cleave protein substrates as part of 

degradation, maturation, and secretory pathways within the 
body. - Proteases have been associated with, inter alia, 
regulating development, infertility, modulating cellular 
processes, fertility, and infectious disease. 

30 

3 . SUMMARY OF THE INVENTION 
The present invention relates to the discovery, 
identification, and characterization of nucleotides that 
encode novel human proteins, and the corresponding amino acid 
35 sequences of these proteins. The novel human proteins (NHPs) 
described for the first time herein share structural 
similarity with animal proteases, and particularly trypsin- 
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like serine proteases such as enteropeptidase (enterokinase) , 
plasminogen, and acrosin . 

The novel human nucleic acid (cDNA) sequences described 
herein, encode proteins/open reading frames (ORFs) of 217, 
5 348, and 2 88 amino acids in length (see SEQ ID NOS: 2, 4, and 
6 respectively) . 

The invention also encompasses agonists and antagonists 
of the described NHPs, including small molecules, large 
molecules, mutant NHPs, or portions thereof that compete with 

10 native NHPs, NHP peptides, and NHP antibodies, as well as 
nucleotide sequences that can be used to inhibit the 
expression of the described NHPs (e.g., antisense and ribozyme 
molecules, and gene or regulatory sequence replacement 
constructs) or to enhance the expression of the described NHPs 

15 (e.g. , expression constructs that place the described sequence 
under the control of a strong promoter system) , and transgenic 
animals that express a NHP transgene, or "knock-outs", (which 
can be conditional) that do not express a functional NHP. 

Further, the present invention also relates to processes 

20 for identifying compounds that modulate, i.e., act as agonists 
or antagonists, of NHP expression and/or NHP activity that 
utilize purified preparations of the described NHP and/or NHP 
product, or cells expressing the same. Such compounds can be 
used as therapeutic agents for the treatment of any of a wide 

25 variety of symptoms associated with biological disorders or 
imbalances. 

4. DESCRIPTION OF THE SEQUENCE LISTING AND FIGURES 
The Sequence Listing provides the sequences of the NHP 
3 0 ORFs encoding the described NHP amino acid sequences. SEQ ID 
NO: 7 describes a NHP ORF with flanking sequences. 

5. DETAILED DESCRIPTION OF THE INVENTION 
The NHPs, described for the first time herein, are novel 
3 5 proteins that are expressed in, inter alia, human cell lines, 
and human testis cells. 
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The described sequences were compiled from gene trapped 
cDNAs and clones isolated from a human testis cDNA library 
(Edge Biosystems, Gaithersburg, MD) . The present invention 
encompasses the nucleotides presented in the Sequence. Listing, 
5 host cells expressing such nucleotides, the expression 

products of such nucleotides, and: (a) nucleotides that encode 
mammalian homologs of the described sequences, including the 
specifically described NHPs, and the NHP products; 
(b) nucleotides that encode one or more portions of a NHP that 

10 correspond to functional domains of the NHP , and the 

polypeptide products specified by such nucleotide sequences, 
including but not limited to the novel regions of any active 
domain (s) ; (c) isolated nucleotides that encode mutant 
versions, engineered or naturally occurring, of a described 

15 NHP in which all or a part of at least one domain is deleted 
or altered, and the polypeptide products specified by such 
nucleotide sequences, including but not limited to soluble 
proteins and peptides in which all or a portion of the signal 
sequence is deleted; (d) nucleotides that encode chimeric 

20 fusion proteins containing all or a portion of a coding region 
of a NHP, or one of its domains (e.g., a receptor or ligand 
binding domain, accessory protein/self -association domain, 
etc.) fused to another peptide or polypeptide; or (e) 
therapeutic or diagnostic derivatives of the described 

25 polynucleotides such as oligonucleotides, antisense 

polynucleotides, ribozymes, dsRNA, or gene therapy constructs 
comprising a sequence first disclosed' in the Sequence Listing. 

As discussed above, the present invention includes: 
(a) the human DNA sequences presented in the Sequence Listing 

30 (and vectors comprising the same) and additionally 

contemplates any nucleotide sequence encoding a contiguous NHP 
open reading frame (ORF) , or a contiguous exon splice junction 
first described in the Sequence Listing, that hybridizes to a 
complement of a DNA sequence presented in the Sequence Listing 

35 under highly stringent conditions, e.g. , hybridization to 
filter-bound DNA in 0 . 5 M NaHP0 4 , 7% sodium dodecyl sulfate 
(SDS) , 1 mM EDTA at 65°C, and washing in 0.1xSSC/0.1% SDS at 
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68°C (Ausubel F.M. et al., eds . , 1989, Current Protocols in 
Molecular Biology, Vol. I, Green Publishing Associates, Inc., 
and John Wiley & sons, Inc., New York, at p. 2.10.3) and 
encodes a functionally equivalent gene product. Additionally 
5 contemplated are any nucleotide sequences that hybridize to 
the complement of the DNA sequence that encode and express an 
amino acid sequence presented in the Sequence Listing under 
moderately stringent conditions, e.g., washing in 0.2xSSC/0.1% 
SDS at 42°C (Ausubel et al., 1989, supra), yet still encode a 

10 functionally equivalent NHP product. Functional equivalents 
of a NHP include naturally occurring NHPs present in other 
species and mutant NHPs whether naturally occurring or 
engineered (by site directed mutagenesis, gene shuffling, 
directed evolution as described in, for example, U.S. Patent 

15 No. 5,837,458). The invention also includes degenerate 
nucleic acid variants of the disclosed NHP polynucleotide 
sequences . 

Additionally contemplated are polynucleotides encoding a 
NHP ORF, or its functional equivalent, encoded by a 

20 polynucleotide sequence that is about 99, 95, 90, or about 85 
percent similar or identical to corresponding regions of the 
nucleotide sequences of the Sequence Listing (as measured by 
BLAST sequence comparison analysis using, for example, the GCG 
sequence analysis package using standard default settings) . 

25 The invention also includes nucleic acid molecules, 

preferably DNA molecules, that hybridize to, and are therefore 
the complements of, the described NHP nucleotide sequences. 
Such hybridization conditions may be highly stringent or less 
highly stringent, as described above. In instances where the 

30 nucleic acid molecules are deoxyoligonucleotides ("DNA 

oligos"), such molecules are generally about 16 to about 100 
bases long, or about 20 to about 80, or about 34 to about 45 
bases long, or any variation or combination of sizes 
represented therein that incorporate a contiguous region of 

35 sequence first disclosed in the Sequence Listing. Such 
oligonucleotides can be used in conjunction with the 
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polymerase chain reaction (PCR) to screen libraries, isolate 
clones, and prepare cloning and sequencing templates , etc. 

Alternatively, such NHP oligonucleotides can be used as 
hybridization probes for screening libraries, and assessing 
5 gene expression patterns (particularly using a micro array or 
high- throughput "chip" format) . Additionally, a series of the 
described NHP oligonucleotide sequences, or the complements 
thereof, can be used to represent all or a portion of the 
described NHP sequences. An oligonucleotide or polynucleotide 

10 sequence first disclosed in at least a portion of one or more 
of the sequences of SEQ ID NOS: 1-7 can be used as a 
hybridization probe in conjunction with a solid support 
matrix/ substrate (resins, beads, membranes, plastics, 
polymers, metal or metallized substrates, crystalline or 

15 polycrystalline substrates, etc.). Of particular note are 
spatially addressable arrays (i.e., gene chips, microtiter 
plates, etc.) of oligonucleotides and polynucleotides, or 
corresponding oligopeptides and polypeptides, wherein at least 
one of the biopolymers present on the spatially addressable 

20 array comprises an oligonucleotide or polynucleotide sequence 
first disclosed in at least one of the sequences of SEQ ID 
NOS: 1-7, or an amino acid sequence encoded thereby. Methods 
for attaching biopolymers to, or synthesizing biopolymers on, 
solid support matrices, and conducting binding studies thereon 

25 are disclosed in, inter alia, U.S. Patent Nos . 5,700, 637, 
5,556,752, 5,744,305, 4,631,211, 5,445,934, 5,252,743, 
4,713,326, 5., 424, 186, and 4,689,405 the disclosures of which 
are herein incorporated by reference in their entirety. 

Addressable arrays comprising sequences first disclosed 

3 0 in SEQ ID NOS: 1-7 can be used to identify and characterize 
the temporal and tissue specific expression of a sequence. 
These addressable arrays incorporate oligonucleotide sequences 
of sufficient length to confer the required specificity, yet 
be within the limitations of the production technology. The 

35 length of these probes is within a range of between about 8 to 
about 2 000 nucleotides. Preferably the probes consist of 60 
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nucleotides and more preferably 25 nucleotides from the 
sequences first disclosed in SEQ ID NOS:l-7. 

For example, a series of the described oligonucleotide 
sequences, or the complements thereof, can be used in chip 
5 format to represent all or a portion of the described 

sequences. The oligonucleotides, typically between about 16 
to about 40 (or any whole number within the stated range) 
nucleotides in length can partially overlap each other and/or 
the sequence may be represented using oligonucleotides that do 

10 not overlap. Accordingly, the described polynucleotide 

sequences shall typically comprise at least about two or three 
distinct oligonucleotide sequences of at least about 8 
nucleotides in length that are each first disclosed in the 
described Sequence Listing. Such oligonucleotide sequences 

15 can begin at any nucleotide present within a sequence in the 
Sequence Listing and proceed in either a sense (5'-to-3') 
orientation vis-a-vis the described sequence or in an^ 
antisense orientation. 

Microarray-based analysis allows the discovery of broad 

2 0 patterns of genetic activity, providing new understanding of 

gene functions and generating novel and unexpected insight 
into transcriptional processes and biological mechanisms. The 
use of addressable arrays comprising sequences first disclosed 
in SEQ ID NOS:l-7 provides detailed information about 
25 transcriptional changes involved in a specific pathway, 

potentially leading to the identification of novel components 
or gene functions that manifest themselves as novel 
phenotypes . 

Probes consisting of sequences first disclosed in SEQ ID 

3 0 NOS:l-7 can also be used in the identification, selection and 

validation of novel molecular targets for drug discovery. The 
use of these unique sequences permits the direct confirmation 
of drug targets and recognition of drug dependent changes in 
gene expression that are modulated through pathways distinct 
35 from the drugs intended target. These unique sequences 

therefore also have utility in defining and monitoring both 
drug action and toxicity. 



6 



8& 



,e ot v,e^ 



***** 



a- 



5 • •• « sv , a v« " 



c 



& 6 ' 



e* 



6< 



\0 



S9' 



9^ 



A 



0"^ 



1$ 



V 



0 V& ^ 



•e 



7,5 



o^' e ^ o^: ^ ^ 



o* s 



< e* B ' .eS-^ \ 9" fC " os^" > o< 

^\Vi ^ ^ ^ ^ w e 60 6> kt o< e 

A VI ,* eS ' „ S V 



•e 



£ e ' wee 



0*9 



top 



e9 



10^ 



3^ 



<o> jW 

C 



60 



WO 01/49864 PCT/US01/00548 

oligos) . These nucleic acid molecules may encode or act as 
- NHP gene antisense molecules, useful, for example, in NHP gene 
regulation (for and/or as antisense primers in amplification 
reactions of NHP nucleic acid sequences) . With respect to NHP 
5 gene regulation, such techniques can be used to regulate 

biological functions. Further, such sequences may be used as 
part of ribozyme and/or triple helix sequences that are also 
useful for NHP gene regulation. 

Inhibitory antisense or double stranded oligonucleotides 
10 can additionally comprise at least one modified base moiety 
which is selected from the group including but not limited to 
5-f luorouracil, 5-bromouracil , 5-chlorouracil, 5-iodouracil, 
hypoxanthine , xant ine , 4-acetylcytosine , 

5- (carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 
15 2-thiouridine , 5-carboxymethylaminomethyluracil , 
dihydrouracil , beta-D-galactosylqueosine, inosine, 
N6- i sopent enyladenine , 1 -me thylguanine , 1 -methyl inos ine , 
2, 2-dimethylguanine, 2 -methyl adenine, 2-methylguanine,* 
3 -methy Icy to s ine , 5 -methyl cy t os ine , N6 -adenine , 

2 0 7-methylguanine , 5-methylaminomethyluracil , 

5-methoxyaminoraethyl-2-thiouracil , beta-D-mannosylqueosine, 
5'-methoxycarboxymethyluracil, 5-methoxyuracil , 

4 

2 -methyl thio-N6-isopentenyladenine, uracil-5-oxyacetic acid 
(v) , wybutoxosine, pseudouracil , queosine, 2-thiocytosine, 
25 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 

5-methyluracil, uracil-5-oxyacetic acid methylester, uracil- 
5-oxyacetic acid (v) , 5-methyl-2-thiouracil , 3- ( 3-amino-3-N-2- 
carboxypropyl ) uracil, (acp3)w, and 2 , 6-diaminopurine . 

The antisense oligonucleotide can also comprise at least 

3 0 one modified sugar moiety selected from the group including 

but not limited to arabinose, 2-f luoroarabinose, xylulose, and 
hexose. 

In yet another embodiment, the antisense oligonucleotide 
will comprise at least one modified phosphate backbone 
35 selected from the group consisting of a phosphorothioate, a 

phosphorodithioate, a phosphoramidothioate, a phosphoramidate, 
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a phosphordiamidate, a iuethylphosphonate, an alkyl 
phosphotriester , and a formacetal or analog thereof. 

In yet another embodiment, the antisense oligonucleotide 
is an a-anomeric oligonucleotide. An cx-anomeric 
oligonucleotide forms specific double- stranded hybrids with 
complementary RNA in which, contrary to the usual (3-units, the 
strands run parallel to each other (Gautier et al., 1987, 
Nucl. Acids Res. 15:6625-6641). The oligonucleotide is a 2 / -0- 
methylribonucleotide (Inoue et al . , 1987, Nucl . Acids Res, 
25:6131-6148), or a chimeric RNA-DNA analogue (Inoue et al . , 
1987, FEBS Lett. 215:327-330). Alternatively, double stranded 

can be used to disrupt the expression and function of a 
targeted NHP . 

Oligonucleotides of the invention can be synthesized by 
standard methods known in the art, e.g. by use of an automated 
DNA synthesizer (such as are commercially available from 
Biosearch, Applied Biosys terns, etc.). As examples, 
phosphorothioate oligonucleotides can be synthesized by the 
method of Stein et al . (1988, Nucl. Acids Res. 25:3209), and 
methylphosphonate oligonucleotides can be prepared by use of 
controlled pore glass polymer supports (Sarin et al., 1988, 
Proc. Natl; Acad. Sci. U.S.A. 55:7448-7451), etc. 

Low stringency conditions are well known to those of 
skill in the art, and will vary predictably depending on the 
specific organisms from which the library and the labeled 
sequences are derived. For guidance regarding such conditions 
see, for example, Sambrook et al . , 1989, Molecular Cloning, A 
Laboratory Manual (and periodic updates thereof) , Cold Springs 
Harbor Press, N.Y.; and Ausubel et al., 1989, Current 
Protocols in Molecular Biology, Green Publishing Associates 
and Wiley Interscience, N.Y. 

Alternatively, suitably labeled NHP nucleotide probes can 
be used to screen a human genomic library using appropriately 
stringent conditions or by PCR. The identification and 
characterization of human genomic clones is helpful for 
identifying polymorphisms (including, but not limited to, 
nucleotide repeats, microsatellite alleles, single nucleotide 
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polymorphisms, or coding single nucleotide polymorphisms) , 
determining the genomic structure of a given locus /allele, and 
designing diagnostic tests. For example, sequences derived 
from regions adjacent to the intron/exon boundaries of the 
5 human gene can be used to design primers for use in 

amplification assays to detect mutations within the exons, 
introns, splice sites {e.g., splice acceptor and/or donor 
sites), etc., that can be used in diagnostics and 
pharmacogenomics . 

10 Further; a NHP homolog can be isolated from nucleic acid 

from an organism of interest by performing PCR using two 
degenerate or "wobble" oligonucleotide primer pools designed 
on the basis of amino acid sequences within the NHP products 
disclosed herein. The template for the reaction may be total 

15 EN A, mRNA, and/or cDNA obtained by reverse transcription of 
mRNA prepared from human or non-human cell lines or tissue 
known or suspected to express an allele of a NHP gene. 

The PCR product can be subcloned and sequenced to ensure 
that the amplified sequences represent the sequence of the 

20 desired NHP gene. The PCR fragment can then be used to 

isolate a full length cDNA clone by a variety of methods. For 
example, the amplified fragment can be labeled and used to 
screen a cDNA library, such as a bacteriophage cDNA library. 
Alternatively, the labeled fragment can be used to isolate 

25 genomic clones via the screening of a genomic library. 

PCR technology can also be used, to isolate full length 
cDNA sequences. For example, RNA can be isolated, following 
standard procedures, from an appropriate cellular or tissue 
source (i.e., one known, or suspected, to express a NHP 

3 0 sequence) . A reverse transcription (RT) reaction can be 

performed on the RNA using an oligonucleotide primer specific 
for the most 5' end of the amplified fragment for the priming 
of first strand synthesis. The resulting RNA/DNA hybrid may 
then be "tailed" using a standard terminal transferase 

35 reaction, the hybrid may be digested with RNase H, and second 
strand synthesis may then be primed with a complementary 
primer. Thus, cDNA sequences upstream of the amplified 
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fragment can be isolated. For a review of cloning strategies 
that can be used, see e.g., Sambrook efc al. t 1983/ supra. 

A cDNA encoding a mutant NHP gene can be isolated, for 
example, by using PCR. In this case, the first cDNA strand 
5 may be synthesized by hybridizing an oligo-dT oligonucleotide 
to mRNA isolated from tissue known or suspected to be 
expressed in an individual putatively carrying a mutant NHP 
allele, and by extending the new strand with reverse 
transcriptase. The second strand of the cDNA is then 
10 synthesized using an oligonucleotide that hybridizes 

specifically to the 5' end of the normal gene. Using these 
two primers, the product is then amplified via PCR, optionally 
cloned into a suitable vector, and subjected to DNA sequence 
analysis through methods well known to those of skill in the 
15 art. By comparing the DNA sequence of the mutant NHP allele 
to that of a corresponding normal NHP allele, the mutation (s) 
responsible for the loss or alteration of function of, the 
mutant NHP gene product can be ascertained. 

Alternatively, a genomic library can be constructed using 
2 0 DNA obtained from an individual suspected of or known to carry 
a mutant NHP allele {e.g., a person manifesting a NHP- 
associated phenotype such as, for example, obesity, high blood 
pressure, connective tissue disorders, infertility, etc.), or 
a cDNA library can be constructed using RNA from a tissue 
25 known, or suspected, to express a mutant NHP allele. A normal 
NHP gene, or any suitable fragment thereof, can then be 
labeled and used as a probe to identify the corresponding 
mutant NHP allele in such libraries . Clones containing mutant 
NHP gene sequences can then be purified and subjected to 
30 sequence analysis according to methods well known to those 
skilled in the art. 

Additionally, an expression library can be constructed 
utilizing cDNA synthesized from, for example, RNA isolated 
from a tissue known, or suspected, to express a mutant NHP 
35 allele in an individual suspected of or known to carry such a 
mutant allele. In this manner, gene products made by the 
putatively mutant tissue can be expressed and screened using 
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standard antibody screening techniques in conjunction with, 
antibodies raised against normal NHP product, as described 
below. (For screening techniques, see, for example, Harlow, 
E. and Lane, eds . , 1988, "Antibodies: A Laboratory Manual" , 
5 Cold Spring Harbor Press, Cold Spring Harbor.) 

Additionally, screening can be accomplished by screening with 
labeled NHP fusion proteins, such as, for example, alkaline 
phosphatase-NHP or NHP-alkaline phosphatase fusion proteins. 
In cases where a NHP mutation results in an expressed gene 

10 product with altered function (e.g. , as a result of a missense 
or a frameshift mutation) , polyclonal antibodies to NHP are 
likely to cross-react with a corresponding mutant NHP gene 
product. Library clones detected via their reaction with such 
labeled antibodies can be purified and subjected to sequence 

15 analysis according to methods well known in the art. 

The invention also encompasses (a) DNA vectors that 
contain any of the foregoing NHP coding sequences and/or their 
complements (i.e., antisense) ; (b) DNA expression vectors that 
contain any of the foregoing NHP coding sequences operatively 

20 associated with a regulatory element that directs the 

expression of the coding sequences (for example, baculo virus 
as described in U.S. Patent No. 5,869,33 6 herein incorporated 
by reference) ; (c) genetically engineered host cells that 
contain any of the foregoing NHP coding sequences operatively 

25 associated with a regulatory element that directs the 

expression of the coding sequences in the host cell; and (d) 
genetically engineered host cells that express an endogenous 
NHP sequence under the control of an exogenous ly introduced 
regulatory element (i.e., gene activation). As used herein, 

3 0 regulatory elements include, but are not limited to, inducible 
and non-inducible promoters, enhancers, operators and other 
elements known to those skilled in the art that drive and 
regulate expression. Such regulatory elements include but are 
not limited to the human cytomegalovirus (hCMV) immediate 

3 5 early gene, regulatable, viral elements (particularly 

retroviral LTR promoters) , the early or late promoters of SV40 
adenovirus, the lac system, the trp system, the TAC system, 
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the TRC system, the major operator and promoter regions of 
phage lambda, the control regions of fd coat protein, the 
promoter for 3-phosphoglycerate kinase (PGK) , the promoters of 
acid phosphatase, and the promoters of the yeast a-mating 
factors . 

The present invention also encompasses antibodies and 
anti- idiotypic antibodies (including Fab fragments) , 
antagonists and agonists of a NHP, as well as compounds or 
nucleotide constructs that inhibit expression of a NHP 
sequence (transcription factor inhibitors, antisense and 
ribozyme molecules, or. gene or regulatory sequence replacement 
constructs), or promote the expression of a NHP {e.g. , 
expression constructs in which NHP coding sequences are 
operatively associated with expression control elements such 
as promoters, promoter /enhancers, etc.). 

The NHPs or NHP peptides, NHP fusion proteins, NHP 
nucleotide sequences, antibodies, antagonists and agonists can 
be useful for the detection of mutant NHPs or inappropriately 
expressed NHPs for the diagnosis of disease. The NHP proteins 
or peptides, NHP fusion proteins, NHP nucleotide sequences, 

■ 

host cell expression systems, antibodies, antagonists, 
agonists and genetically engineered cells and animals can be 
used for screening for drugs (or high throughput screening of 
combinatorial libraries) effective in the treatment of the 
symptomatic or phenotypic manifestations of perturbing the 
normal function of a NHP in the body. The use of engineered 
host cells and/or animals may offer an advantage in that such 
systems allow not only for the identification of compounds 
that bind to the endogenous receptor for a NHP, but can also 
identify compounds that trigger NHP-mediated activities or 
pathways . 

Finally, the NHP products can be used as therapeutics. 
For example, soluble derivatives such as NHP peptides /domains 
corresponding to NHP, NHP fusion protein products (especially 
NHP-Ig fusion proteins, i.e., fusions of a NHP, or a domain of 
a NHP, to an IgFc) , NHP antibodies and anti-idiotypic 
antibodies (including Fab fragments), antagonists or agonists 
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(including compounds that modulate or act on downstream 
targets in a NHP-mediated pathway) can be used to directly 
treat diseases or disorders. For instance, the administration 
of an effective amount of soluble NHP, or a NHP-IgFc fusion 
5 protein or an anti-idiotypic antibody (or its Fab) that mimics 
the NHP could activate or effectively antagonize the 
endogenous NHP receptor. Nucleotide constructs encoding such 
NHP products can be used to genetically engineer host cells to 
express such products in vivo; these genetically engineered 

10 cells function as "bioreactors " in the body delivering a 
continuous supply of a NHP, a NHP peptide, or a NHP fusion 
protein to the body. Nucleotide constructs encoding 
functional NHP, mutant NHPs, as well as antisense and ribozyme 
molecules can also be used in "gene therapy" approaches for 

15 the modulation of NHP expression. Thus, the invention also 
encompasses pharmaceutical formulations and methods for 
treating biological disorders. 

Various aspects of the invention are described in greater 
detail in the subsections below. 

20 

5.1 THE NHP SEQUENCES 
The cDNA sequences (SEQ ID NO: 1, 3, and 5) and the 
corresponding deduced amino acid sequences of the described 
NHP are presented in the Sequence Listing. SEQ ID NO: 7 

25 describes a NHP ORF as well as flanking regions. The NHP 
nucleotides were obtained from human cDNA libraries using 
probes and/or primers generated from human gene trapped 
sequence tags. Expression analysis has provided evidence that 
the described NHPs can be expressed in human tissue as well as 

30 gene trapped human cells. In addition, the described NHP 
sequences can contain a variety of polymorphisms such as at 
nucleotide 28 of SEQ ID NO: 3 and nucleotide 55 of SEQ ID NO: 3 
which both can be a C or a T and can give rise to silent 
mutation at corresponding amino acid position 10 of SEQ ID 

35 NO: 4 or a tyr or his at amino acid position 19 of SEQ ID N0:4. 
The described NHP sequences can also contain G-A polymorphisms 
at nucleotide 379 of SEQ ID NO: 3 and nucleotide position 199 
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of SEQ ID NO: 5 which can give rise to a corresponding ala or 
thr at amino acid position 127 of SEQ ID NO : 4 , or residue 67 
of SEQ ID NO: 6. The described NHPs share similarity with 
trypsin-like proteases, plasminogens, and acrosins. 

5 

5.2 NHPs AND NHP POLYPEPTIDES 
NHPs, polypeptides, peptide fragments, mutated, 
truncated, or deleted forms of the NHPs, and/or NHP fusion 
proteins can be prepared for a variety of uses. These uses 

10 include, but are not limited to, the generation of antibodies, 
as reagents in diagnostic assays, for the identification of 
other cellular gene products related to a NHP, as reagents in 
assays for screening for compounds that can be as 
pharmaceutical reagents useful in the therapeutic treatment of 

15 mental, biological, or medical disorders and disease. 

The Sequence Listing discloses the amino acid sequence 
encoded by the described NHP polynucleotides. The NHPs 
display initiator methionines in DNA sequence contexts 
consistent with a translation initiation site, and display a 

20 consensus signal sequence. 

The NHP amino acid sequences of the invention include the 
amino acid sequences presented in the Sequence Listing as well 
as analogues and derivatives thereof, as well as any 
oligopeptide sequence of at least about 10-40, generally about 

25 12-35, or about 16-30 amino acids in length first disclosed in 
the Sequence Listing. Further, corresponding NHP homologues 
from other species are encompassed by the invention. In fact, 
any NHP encoded by the NHP nucleotide sequences described 
above are within the scope of the invention, as are any novel 

30 polynucleotide sequences encoding all or any novel portion of 
an amino acid sequence presented in the Sequence Listing. The 
degenerate nature of the genetic code is well known, and, 
accordingly, each amino acid presented in the Sequence 
Listing, is generically representative of the well known 

35 nucleic acid "triplet" codon, or in many cases codons, that 
can encode the amino acid. As such, as contemplated herein, 
the amino acid sequences presented in the Sequence Listing, 



15 



4® V , ... W 



^0 



8** 



^ t> e r ^ .A<^ & 



CP 



6!> 



* v y /, ^ ^ e v ^< 

*>° .A^' .. 9 oi A0 ^ 9 W^ 0 x^<* ^ „.* C * 



Ae$ i&> 



/ V V v 



•VP- 



r6^ 



0^^ 



vO 



0> 



6^ 



■9 s - o-' 



^ "... ,a.^' 



s e*' c oS v 

A^ e ' ^ ^ s -g 



6- e 



8? 



0^ 



0^ 



0^ 



0 



WO 01/49864 



PCT/USO 1/00548 



Purification or enrichment of NHP from such expression systems 
can be accomplished using appropriate detergents and lipid 
micelles and methods well known to those skilled in the art. 
However, such engineered host cells themselves may be used in 
5 situations where it is important not only to retain the 

structural and functional characteristics of the NHP, but to 
assess biological activity, e.g., in drug screening assays. 

The expression systems that may be used for purposes of 
the invention include but are not limited to microorganisms 

10 such as bacteria (e.g., E. coli , J3. subtilis) transformed with 
recombinant bacteriophage DNA, plasrnid DNA or cosmid DNA 
expression vectors containing NHP nucleotide sequences; yeast 
(e.g. , Saccharomyces , Pichia) transformed with recombinant 
yeast expression vectors containing NHP encoding nucleotide 

15 sequences; insect cell systems infected with recombinant virus 
expression vectors (e.g., baculovirus) containing NHP 
sequences; plant cell systems infected with recombinant virus 
expression vectors (e.g., cauliflower mosaic virus, CaMV; 
tobacco mosaic virus, TMV) or transformed with recombinant 

20 plasrnid expression vectors (e.g., Ti plasrnid) containing NHP 
nucleotide sequences; or mammalian cell systems (e.g., COS, 
CHO, BHK, 293, 3T3) harboring recombinant expression 
constructs containing promoters derived from the genome of 
mammalian cells (e.g., metallothionein promoter) or from 

25 mammalian viruses (e.g., the adenovirus late promoter; the 
vaccinia virus 7 . 5K promoter) . 

In bacterial systems, a number of expression vectors may 
be advantageously selected depending upon the use intended for 

» 

the NHP product being expressed. For example, when a large 
30 quantity of such a protein is to be produced for the 

generation of pharmaceutical compositions of or containing 
NHP, or for raising antibodies to a NHP, vectors that direct 
the expression of high levels of fusion protein products that 
are readily purified may be desirable. Such vectors include, 
35 but are not limited, to the E. coli expression vector pUR278 
(Ruther efc al . , 1983, EMBO J. 2:1191), in which a NHP coding 
sequence may be ligated individually into the vector in frame 

17 
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with the lacZ coding region so that a fusion protein is 
produced; pIN vectors (Inouye & Inouye, 1985, Nucleic Acids 
Res. 23:3101-3109; Van Heeke & Schuster, 1989, J. Biol. Chem. 
264: 5503-5509) ; and the like. pGEX vectors (Pharmacia or 
5 American Type Culture Collection) can also be used to express 
foreign polypeptides as fusion proteins with glutathione 
S-transf erase (GST) . In general, such fusion proteins are 
soluble and can easily be purified from lysed cells by 
adsorption to glutathione-agarose beads followed by elution in 

10 the presence of free glutathione. The PGEX vectors are 

designed to include thrombin or factor Xa protease cleavage 
sites so that the cloned target sequence product can be 
released from the GST moiety. 

In an insect system, Autographa calif omica nuclear 

15 polyhidrosis virus (AcNPV) is used as a vector to express 

foreign sequences. The virus grows in Spodoptera frugiperda 
cells. A NHP coding sequence can be cloned individually into 
non-essential regions (for example the polyhedrin gene) of the 
virus and placed under control of an AcNPV promoter (for 

20 example the polyhedrin promoter) . Successful insertion of NHP 
coding sequence will result in inactivation of the polyhedrin 
gene and production of non-occluded recombinant virus (i.e., 
virus lacking the proteinaceous coat coded for by the 
polyhedrin gene) , These recombinant viruses are then used to 

25 infect Spodoptera frugiperda cells in which the inserted 
sequence is expressed (e.g., see Smith et al . , 1983, J. 
Virol. 46: 584; Smith, U.S. Patent No. 4,215,051). 

In mammalian host cells, a number of viral -based 
expression systems may be utilized. In cases where an 

3 0 adenovirus is used as an expression vector, the NHP nucleotide 
sequence of interest may be ligated to an adenovirus 
transcription/translation control complex, e.g., the late 
promoter and tripartite leader sequence. This chimeric 
sequence may then be inserted in the adenovirus genome by in 

35 vitro or in vivo recombination. Insertion in a non-essential 
region of the viral genome (e.g., region El or E3) will result 
in a recombinant virus that is viable and capable of 
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expressing a NHP product in infected hosts {e.g., See Logan & 
Shenk, 1984, Proc. Natl. Acad. Sci. USA 52:3655-3 659). 
Specific initiation signals may also be required for efficient 
translation of inserted NHP nucleotide sequences. These 
5 signals include the ATG initiation codon and adjacent 
sequences. In cases where an entire NHP gene or cDNA, 
including its own initiation codon and adjacent sequences, is 
inserted into the appropriate expression vector, no additional 
translational control signals may be needed. However, in 

10 cases where only a portion of a NHP coding sequence is 

inserted, exogenous translational control signals, including, 
perhaps, the ATG initiation codon, must be provided. 
Furthermore, the initiation codon must be in phase with the 
reading frame of the desired coding sequence to ensure 

15 translation of the entire insert. These exogenous 

translational control signals and initiation codons can be of 
a variety of origins, both natural and synthetic. The 
efficiency of expression can be enhanced by the inclusion of 
appropriate transcription enhancer elements, transcription 

20 terminators, etc. (See Bittner et al., 1987, Methods in 
Enzymol. 153:516-544). 

In addition, a host cell strain may be chosen that 
modulates the expression of the inserted sequences, or 
modifies and processes the gene product in the specific 

25 fashion desired. Such modifications (e.g., glycosylation) and 
processing (e.g., cleavage) of protein products may be 
important for the function of the protein. Different host 
cells have characteristic and specific mechanisms for the 
post-translational processing and modification of proteins and 

3 0 gene products. Appropriate cell lines or host systems can be 
chosen to ensure the correct modification and processing of 
the foreign protein expressed. To this end, eukaryotic host 
cells which possess the cellular machinery for proper 
processing of the primary transcript, glycosylation, and 

3 5 phosphorylation of the gene product may be used. Such 

mammalian host cells include, but are not limited to, CHO, 
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VERO, BHK, HeLa, COS , MDCK, 293, 3T3, WI38, and in particular, 
human cell lines . 

For long-term, high-yield production of recombinant 
proteins, stable expression is preferred. For example, cell 
lines which stably express the NHP sequences described above 
can be engineered. Rather than using expression vectors which 
contain viral origins of replication, host cells can be 
transformed with DNA controlled by appropriate expression 
control elements (e.g., promoter, enhancer sequences, 
transcription terminators, polyadenylation sites, etc.), and a 
selectable marker. Following the introduction of the foreign 
DNA, engineered cells may be allowed to grow for 1-2 days in 
an enriched media, and then are switched to a selective media. 
The selectable marker in the recombinant plasmid confers 
resistance to the selection and allows cells to stably 
integrate the plasmid into their chromosomes and grow to form 
foci which in turn can be cloned and expanded into cell lines. 
This method may advantageously be used to engineer cell lines 
which express the NHP product. Such engineered cell lines may 
be particularly useful in screening and evaluation of 
compounds that affect the endogenous activity of the NHP 
product . 

A number of selection systems may be used, including but 
not limited to the herpes simplex virus thymidine kinase 
(Wigler, et al., 1917, Cell 21:223), hypoxanthine-guanine 
phosphoribosyltransf erase (Szybalska & Szybalski, 1962, Proc, 
Natl. Acad. Sci. USA 45:2026), and adenine 

phosphoribosyltransf erase (Lowy, et al., 1980, Cell 22:817) 
genes can be employed in tk", hgprt" or aprt"- cells, 
respectively. Also, antimetabolite resistance can be used as 
the basis of selection for the following genes: dhfr, which 
confers resistance to methotrexate (Wigler, et al., 1980, 
Natl. Acad. Sci. USA 77:3567; O'Hare, et al . , 1981, Proc. 
Natl. Acad. Sci. USA 7S;1527); gpt, which confers resistance 
to mycophenolic acid (Mulligan & Berg, 1981, Proc. Natl. Acad. 
Sci. USA 75:2072); neo, which confers resistance to the 
aminoglycoside G-418 (Colberre-Garapin, et al . , 1981, J. Mol. 
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Biol. 150:1); and hygro, which confers resistance to 
hygromycin (Santerre, et al . , 1984, Gene 30:147), 

Alternatively, any fusion protein can be readily purified 
by utilizing an antibody specific for the fusion protein being 
5 expressed. For example, a system described by Janknecht et 
al. allows for the ready purification of non-denatured fusion 
proteins expressed in human cell lines (Janknecht, et al . , 
1991, Proc, Natl. Acad. Sci. USA 88:8972-8976). In this 
system, the sequence of interest is subcloned into a vaccinia 

10 recombination' plasmid such that the sequence's open reading 
frame is translationally fused to an amino-terminal tag 
consisting of six histidine residues. Extracts from cells 
infected with recombinant vaccinia virus are loaded onto 
Ni 2+ -nitriloacetic acid-agarose columns and histidine- tagged 

15 proteins are selectively eluted with imidazole-containing 
buffers . 

Also encompassed by the present invention are fusion 
proteins that direct the NHP to a target organ and/or 
facilitate transport across the membrane into the cytosol. 
20 Conjugation of NHPs to antibody molecules or their Fab 

fragments could be used to target cells bearing a particular 
epitope. Attaching the appropriate signal sequence to the NHP 
would also transport the NHP to the desired location within 
the cell. Alternatively targeting of NHP or its nucleic acid 

2 5 sequence might be achieved using liposome or lipid complex 

based delivery systems. Such technologies are described in 
Liposomes: A Practical Approach , New RRC ed. , Oxford University 
Press, New York and in U.S. Patents Nos . 4,594,595, 5,459,127, 
5,948,7 67 and 6,110,490 and their respective disclosures which 

3 0 are herein incorporated by reference in their entirety. 

Additionally embodied are novel protein constructs engineered 
in such a way that they facilitate transport of the NHP to the 
target site or desired organ, where they cross the cell 
membrane and/or the nucleus where the NHP can exert its 
35 functional activity. This goal may be achieved by coupling of 
the NHP to a cytokine or other ligand that provides targeting 
specificity, and/or to a protein transducing domain (see 
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generally U. S , applications Ser. No. 60/111/701 and 
60/056,713, both of which are herein incorporated by 
reference, for examples of such transducing sequences) to 
facilitate passage across cellular membranes and can 
5 optionally be engineered to include nuclear localization 
sequences . 

5.3 ANTIBODIES TO NHP PRODUCTS 
Antibodies that specifically recognize one or more 

10 epitopes of a NHP, or epitopes of conserved variants of a NHP, 
or peptide fragments of a NHP are also encompassed by the 
invention. Such antibodies include but are not limited to 
polyclonal antibodies, monoclonal antibodies (mAbs) , humanized 
or chimeric antibodies, single chain antibodies, Fab 

15 fragments, F(ab') 2 fragments, fragments produced by a Fab 

expression library, anti-idiotypic (anti-Id) antibodies, and 
epitope-binding fragments of any of the above. 

The antibodies of the invention may be used, for example, 
in the detection of NHP in a biological sample and may, 

20 therefore, be utilized as part of a diagnostic or prognostic 
technique whereby patients may be tested for abnormal amounts 
of NHP. Such antibodies may also be utilized in conjunction 
with, for example, compound screening schemes for the 
evaluation of the effect of test compounds on expression 

25. and/or activity of a NHP gene product. Additionally, such 
antibodies can be used in conjunction gene therapy to, for 
example, evaluate the normal and/or engineered NHP- expressing 
cells prior to their introduction into the patient. Such 
antibodies may additionally be used as a method for the 

3 0 inhibition of abnormal NHP activity. Thus, such antibodies 
may, therefore, be utilized as part of treatment methods. 

For the production of antibodies, various host animals 
may be immunized by injection with the NHP, an NHP peptide 
{e.g. r one corresponding the a functional domain of an NHP), 

35 truncated NHP polypeptides (NHP in which one or more domains 
have been deleted) , functional equivalents of the NHP or , 
mutated variant of the NHP. , Such host animals may include but 
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are not limited to pigs, rabbits, mice, goats, and rats, to 
name but a few. Various adjuvants may be used to increase the 
immunological response, depending on the host species, 
including but not limited to Freund's adjuvant (complete and 
incomplete) , mineral salts such as aluminum hydroxide or 
aluminum phosphate, surface active substances such as 
lysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, and potentially useful human adjuvants such as BCG 
(bacille Calmette-Guerin) and Corynejbacteriuzn parvum. 
Alternatively, the immune response could be enhanced by 
combination and or coupling with molecules such as keyhole 
limpet hemocyanin, tetanus toxoid, diptheria toxoid, 
ovalbumin, cholera toxin or fragments thereof . Polyclonal 
antibodies are heterogeneous populations of antibody molecules 
derived from the sera of the immunized animals. 

Monoclonal antibodies, which are homogeneous populations 
of antibodies to a particular antigen, can be obtained by any 
technique which provides for the production of antibody 
molecules by continuous cell lines in culture. These include, 
but are not limited to, the hybridoma technique of Kohler and 
Milstein, (1975, Nature 255:495-497; and U.S, Patent No. 
4,376,110), the human B-cell hybridoma technique (Kosbor et 
al., 1983, Immunology Today 4:72; Cole et al. , 1983, Proc. 
Natl. Acad. Sci. USA 80:2026-2030), and the EBV-hybridoma 
technique (Cole et al . , 1985, Monoclonal Antibodies And Cancer 
Therapy, Alan R. Liss, Inc., pp. 77-96). Such antibodies may 
be of any immunoglobulin class including IgG, IgM, IgE, IgA, 
IgD and any subclass thereof. The hybridoma producing the mAb 
of this invention may be cultivated in vitro or in vivo. 
Production of high titers of mAbs in vivo makes this the 
presently preferred method of production. 

In addition, techniques developed for the production of 
"chimeric antibodies" (Morrison et al . , 1984, Proc. Natl. 
Acad. Sci., 81 : 6851-6855 ; Neuberger et al . , 1984, Nature, 
312:604-608; Takeda et al., 1985, Nature, 314:452-454) by 
splicing the genes from a mouse antibody molecule of 
appropriate antigen specificity together with genes from a 
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human antibody molecule of appropriate biological activity can 
be used. A chimeric antibody is a molecule in which different 
portions are derived from different animal species, such as 
those having a variable region derived from a murine mAb and a 
5 human immunoglobulin constant region. Such technologies are 
described in U.S. Patents Nos. 6,075,181 and 5,877,397 and 
their respective disclosures which are herein incorporated by 
reference in their entirety. 

Alternatively, techniques described for the production of 

10 single chain antibodies (U.S. Patent 4,946,778; Bird, 1988,' 
Science 242:423-426; Huston et al . , 1988, Proc . Natl. Acad. 
Sci. USA 85:5879-5883; and Ward et al., 1989, Nature 334:544- 
546) can be adapted to produce single chain antibodies against 
NHP sequence products. Single chain antibodies are formed by 

15 linking the heavy and light chain fragments of the Fv region 
via an amino acid bridge, resulting in a single chain 
polypeptide. 

Antibody fragments which recognize specific epitopes may 
be generated by known techniques. For example, such fragments 

20 include, but are not limited to: the F(ab') 2 fragments which 
can be produced by pepsin digestion of the antibody molecule 
and the Fab fragments which can be generated by reducing the 
disulfide bridges of the F(ab') 2 fragments. Alternatively, Fab 
expression libraries may be constructed (Huse et al., 1989, 

25 Science, 246: 1275-1281) to allow rapid and easy identification 
of monoclonal Fab fragments with the desired specificity. 

Antibodies to a NHP can, in turn, be utilized to generate 
anti-idiotype antibodies that "mimic" a given NHP, using 
techniques well known to those skilled in the art. (See, 

30 e.g., Greenspan & Bona, 1993, FASEB J 7(5) :437-444; and 

Nissinoff, 1991, J. Immunol. 147(8) : 2429-2438) . For example 
antibodies which bind to a NHP domain and competitively 
inhibit the binding of NHP to its cognate receptor can be used 
to generate anti-idiotypes that "mimic" the NHP and, 

35 therefore, bind and activate or neutralize a receptor. Such 
anti-idiotypic antibodies or Fab fragments of such anti- 
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idio types can be used in therapeutic regimens involving a NHP 
signaling pathway. 

The present invention is not to be limited in scope by 
the specific embodiments described herein, which are intended 
5 as single illustrations of individual aspects of the 

invention, and functionally equivalent methods and components 
are within the scope of the invention. Indeed, various 
modifications of the invention, in addition to those shown and 
described herein will become apparent to those skilled in the 
10 art from the foregoing description. Such modifications are 

intended to fall within the scope of the appended claims. All 
cited publications, patents, and patent applications are 
herein incorporated by reference in their entirety. 
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WHAT IS CLAIMED IS : 

1. An isolated nucleic acid molecule comprising at 
least 24 contiguous bases of nucleotide sequence first 

5 disclosed in the NHP sequence described in SEQ ID NO: 3. 

2. An isolated nucleic acid molecule comprising a 
nucleotide sequence that: 

(a) encodes the amino acid sequence shown in SEQ ID 
10 NO: 4; and 



(b) 



hybridizes under stringent conditions to the 
nucleotide sequence of SEQ ID NO: 3 or the 
complement thereof . 



15 



3 . An isolated nucleic acid molecule encoding the 
amino acid sequence described in SEQ ID NO: 6, 



A 




amino acid sequence described in SEQ ID NO: 2. 



20 
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SEQUENCE LISTING 

<110> Walke, D. Wade 

Wilganowski, Nathaniel L. 

Donoho , Gregory 

Turner, C. Alexander Jr. 

<12 0> Novel Human Proteases and Polynucleotides Encoding the Same 

<13 0> LEX-0114-PCT 

<150> US 60/174,686 
<151> 2000-01-06 

<160> 7 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 654 
<212> DNA 
<213> Homo sapien 

<400> 1 

atgtggacag ctgtgattgg 
ataaaaatta aagcaatcat 
attgcacttt ttcacttaaa 
ctaccttttg atgttttcca 
ggaagaacaa aagaagaagg 
atttctcgag agatgtgtaa 
ttttgtgcag gtgatgaaga 
ttaatgtgct acttaccaga 
catggctgtg gtcgaagagg 
tggctgacag agcatttctt 
cgtggccaga tcctcatagc 

<210> 2 
<211> 217 
<212> PRT 
<213> Homo sapien 



<400> 2 




















Met 


Trp 


Thr 


Ala 


Val 


He 


Gly Thr Asn 


Asn 


He His 


Gly Arg Tyr Pro 


1 








5 








10 




15 


His 


Thr 


Lys 


Lys 


He 


Lys 


He Lys 


Ala 


He 


lie lie 


His Pro Asn Phe 








20 








25 






30 


lie 


Leu 


Glu 


Ser 


Tyr 


Val 


Asn Asp 


He 


Ala 


Leu Phe 


His Leu Lys Lys 






35 








40 








45 


Ala 


Val 


Arg 


Tyr 


Asn 


Asp 


Tyr He 


Gin 


Pro 


He Cys 


Leu Pro Phe Asp 




50 










55 






60 


Val 


Phe 


Gin 


He 


Leu 


Asp 


Gly Asn 


Thr 


Lys 


Cys Phe 


He Ser Gly Trp 


65 










70 








75 


80 


Gly 


Arg 


Thr 


Lys 


Glu 


Glu 


Gly Asn Ala 


Thr 


Asn He 


Leu Gin Asp Ala 










85 








90 




95 


Glu 


Val 


His 


Tyr 


He 


Ser 


Arg Glu 


Met 


Cys 


Asn Ser 


Glu Arg Ser Tyr 








100 








105 






110 


Gly 


Gly 


He 


lie 


Pro 


Asn 


Thr Ser 


Phe 


Cys 


Ala Gly 


Asp Glu Asp Gly 






115 








120 








125 


Ala 


Phe 


Asp 


Thr 


Cys 


Arg 


Gly Asp 


Ser 


Gly 


Gly Pro 


Leu Met Cys Tyr 




130 










135 






140 


Leu 


Pro 


Glu 


Tyr 


Lys 


Arg 


Phe Phe 


Val 


Met 


Gly He 


Thr Ser Tyr Gly 


145 










150 








155 


160 



1/4 



aactaataat atacatggac gctatcctca taccaagaag 60 

tattcatcca aacttcattt tggaatctta tgtaaatgat 12 0 

aaaagcagtg aggtataatg actatattca gcctatttgc 180 

aatcctggac ggaaacacaa agtgttttat aagtggctgg 240 

taacgctaca aatattttac aagatgcaga agtgcattat 300 

ttctgagagg agttatgggg gaataattcc taacacttca 3 60 

tggagctttt gatacttgca ggggtgacag tgggggacca 42 0 

atataaaaga ttttttgtaa tgggaattac cagttacgga 480 

ttttcctggt gtctatattg ggccatcctt ctaccaaaag 540 

ccatgcaagc actcaaggca tacttactat aaatatttta 600 

tttatgtttt gtcatcttac tagcaacaac ataa 654 



WO 01/49864 



PCT/US01/00548 



His Gly Cys Gly 

Phe Tyr Gin Lys 

180 

Gly lie Leu Thr 
195 

Cys Phe Val lie 
210 



Arg Arg Gly Phe 
165 

Trp Leu Thr Glu 

lie Asn He Leu 

200 

Leu Leu Ala Thr 
215 



Pro Gly Val Tyr 
170 

His Phe Phe His 
185 

Arg Gly Gin He 
Thr 



He Gly Pro Ser 
175 

Ala Ser Thr Gin 
190 

Leu He Ala Leu 
205 



<210> 3 
<211> 1047 
<212> DNA 
<213> Homo sapien 

<400> 3 

atgcggctgg ggctcctgag cgtggcgytg ttgtttgtgg ggagctctca cttayactca 60 

gaccactact cgccctctgg aaggcacagg ctcggcccct cgccggaacc ggcggctagt 12 0 

tcccagcagg ctgaggccgt ccgcaagagg ctccggcggc ggagggaggg aggggcgcat 180 

gcaaaggatt gtggaacagc accgcttaag gatgtgttgc aagggtctcg gattataggg 240 

ggcaccgaag cacaagctgg cgcatggccg tgggtggtga gcctgcagat taaatatggc 3 00 

cgtgttcttg ttcatgtatg tgggggaacc ctagtgagag agaggtgggt cctcacagct 3 60 

gcccactgca ctaaagacrc tagcgatcct ttaatgtgga cagctgtgat tggaactaat 420 

aatatacatg gacgctatcc tcataccaag aagataaaaa ttaaagcaat cattattcat 4B0 

ccaaacttca ttttggaatc ttatgtaaat gatattgcac tttttcactt aaaaaaagca 540 

gtgaggtata atgactatat tcagcctatt tgcctacctt ttgatgtttt ccaaatccfcg 600 

gacggaaaca caaagtgttt tataagtggc tggggaagaa caaaagaaga aggtaacgct 660 

acaaatattt tacaagatgc agaagtgcat tatatttctc gagagatgtg taattctgag 72 0 

aggagttatg ggggaataat tcctaacact tcattttgtg caggtgatga agatggagct 780 

tttgatactt gcaggggtga cagtggggga ccattaatgt gctacttacc agaatataaa 840 

agattttttg taatgggaat taccagttac ggacatggct gtggtcgaag aggttttcct 900 

ggtgtctata ttgggccatc cttctaccaa aagtggctga cagagcattt cttccatgca 96 0 

agcactcaag gcatacttac tataaatatt ttacgtggcc agatcctcat agctttatgt 1020 

tttgtcatct, tactagcaac aacataa 1047 



<210> 4 
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<210> 5 

<211> 867 

<212> DMA 

<213> Homo sapien 

<400> 5 

atgeaaaatt gtggaacagc 
ggcaccgaag cacaagctgg 
cgtgttcttg ttcatgtatg 
gcccactgca ctaaagacrc 
aatatacatg gacgctatcc 
ccaaacttca ttttggaatc 
gtgaggtata atgactatat 
gaeggaaaca caaagtgttt 
acaaatattt tacaagatgc 
aggagttatg ggggaataat 
tttgatactt gcaggggtga 
agattttttg taatgggaat 
ggtgtctata ttgggccatc 
agcactcaag gcatacttac 
tttgtcatct tactagcaac 

<210> 6 

<211> 288 

<212> PRT 

<213> Homo sapien 
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acegcttaag gatgtgttgc aagggtctcg gattataggg 60 

cgcatggccg tgggtggtga gectgeagat taaatatggc 120 

tgggggaacc ctagtgagag agaggtgggt cctcacagct 180 

tagegatect ttaatgtgga cagctgtgat tggaactaat 240 

tcataccaag aagataaaaa ttaaagcaat cattattcat 3 00 

ttatgtaaat gatattgeae tttttcactt aaaaaaagca 360 

teagectatt tgcctacctt ttgatgtttt ccaaatcctg 420 

tataagtggc tggggaagaa caaaagaaga aggtaacget 480 

agaagtgeat tatatttctc gagagatgtg taattctgag 540 

tcctaacact tcattttgtg caggtgatga agatggagct 600 

cagtggggga ccattaatgt gctacttacc agaatataaa 660 

taccagttac ggacatggct gtggtcgaag aggttttcct 720 

cttctaccaa aagtggctga cagagcattt cttccatgca 780 

tataaatatt ttacgtggcc agatcctcat agctttatgt 840 

aacataa 867 
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100 105 110 

Ala Leu Phe His Leu Lys Lys Ala Val Arg Tyr Asn Asp Tyr lie Gin 

115 ~ 120 125 

Pro lie Cys Leu Pro Phe Asp Val Phe Gin lie Leu Asp Gly Asn Thr 

130 ~ 135 140 

Lys Cys Phe He Ser Gly Trp Gly Arg Thr Lys Glu Glu Gly Asn Ala 
145 ^ 150 155 160 

Thr Asn lie Leu Gin Asp Ala Glu Val His Tyr lie Ser Arg Glu Met 

165 . 170 175 

Cys Asn Ser Glu Arg Ser Tyr Gly Gly lie lie Pro Asn Thr Ser Phe 

180 185 190 

Cys Ala Gly Asp Glu Asp Gly Ala Phe Asp Thr Cys Arg Gly Asp Ser 

195 200 205 

Gly Gly Pro Leu Met Cys Tyr Leu Pro Glu Tyr Lys Arg Phe Phe Val 

210 " 215 220 

Met Gly lie Thr Ser Tyr Gly His Gly Cys Gly Arg Arg Gly Phe Pro 
225 ■ 230 235 : 240 

Gly Val Tyr lie Gly Pro Ser Phe Tyr Gin Lys Trp Leu Thr Glu His 

245 250 255 

Phe Phe His Ala Ser Thr Gin Gly lie Leu Thr lie Asn He Leu Arg 

260 265 270 

Gly Gin He Leu He Ala Leu Cys Phe Val He Leu Leu Ala Thr Thr 
275 280 285 

('' 

<210> 7 
<211> 1286 
<212> DNA 
<213> Homo sapien 

<400> 7 

ttcttccatt tcaggtgtcg tgaaaagctt gaattcggcg cgccagatat cacacgtgcc 60 

aaggggctgg ctcgccgcca tcttgctcac cagcctccaa aatgcggctg gggctcctga 120 

gcgtggcgct gttgtttgtg gggagctctc acttatactc agaccactac tcgccctctg 180 

gaaggcacag gctcggcccc tcgccggaac cggcggctag ttcccagcag gctgaggccg 240 

tccgcaagag gctccggcgg cggagggagg gaggggcgca tgcaaaggat tgtggaacag 300 

caccgcttaa ggatgtgttg caagggtctc ggattatagg gggcaccgaa gcacaagctg 360 

gcgcatggcc gtgggtggtg agcctgcaga ttaaatatgg ccgtgttctt gttcatgtat 420 

gtgggggaac cctagtgaga gagaggtggg tcctcacagc tgcccactgc actaaagacg 480 

ctagcgatcc tttaatgtgg acagctgtga ttggaactaa taatatacat ggacgctatc 540 

ctcataccaa gaagataaaa attaaagcaa tcattattca tccaaacttc attttggaat 600 

cttatgtaaa tgatattgca ctttttcact taaaaaaagc agtgaggtat aatgactata 660 

ttcagcctat ttgcctacct tttgatgttt tccaaatcct ggacggaaac acaaagtgtt 720 

ttataagtgg ctggggaaga acaaaagaag aaggtaacgc tacaaatatt ttacaagatg 780 

cagaagtgca ttatatttct cgagagatgt gtaattctga gaggagttat gggggaataa 840 

ttcctaacac ttcattttgt gcaggtgatg aagatggagc ttttgatact tgcaggggtg 900 

acagtggggg accattaatg tgctacttac cagaatataa aagatttttt gtaatgggaa 960 

ttaccagtta cggacatggc tgtggtcgaa gaggttttcc tggtgtctat attgggccat 1020 

ccttctacca aaagtggctg acagagcatt tcttccatgc aagcactcaa ggcatactta 1080 

ctataaatat tttacgtggc cagatcctca tagctttatg ttttgtcatc ttactagcaa 1140 

caacataaag aaattctgaa ggctttcata tctttatttt gcattgtgtc cctttctatg 1200 

ttctatataa tgaacatcat ttattcttct agcaattaat tgcctacatt agagatttca 1260 

tgtgaacatt ttatgggcta taaata 1286 
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